Generation of robust phonetic set and decision tree for Mandarin using chi-square testing

نویسندگان

  • Yeou-Jiunn Chen
  • Chung-Hsien Wu
  • Yu-Hsien Chiu
  • Hsiang-Chuan Liao
چکیده

A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. A phonetic representation with smaller phonetic units such as SAMPA-C for Mandarin Chinese and decision trees for parameter sharing are broadly applied to deal with the problem of large numbers of recognition units. However, the confusable phonetic representation in SAMPA-C generally degrades the recognition performance. In this paper, a statistical method based on chi-square testing is used to investigate the phonetic unit characteristics that are confusing and develop a more reliable phonetic set, named modified SAMPA-C. A corresponding question set for the modified SAMPA-C and a two-level splitting criterion are also proposed to effectively and efficiently construct the decision trees. Experiments using continuous Mandarin telephone speech recognition were conducted. Experimental results show that an encouraging improvement in recognition performance can be obtained. The proposed approaches represent a good compromise between the demands of accurate acoustic modeling and the limitations imposed by insufficient training data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation

A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition ...

متن کامل

Using Robust Decision Tree Construction for Continuous Speech Recognition

Context-dependent units using decision tree have been broadly used to model the co-articulation effects and the speech variation in speech recognition. Decision trees are generally constructed in a data driven way and guided by linguistic information that contains a priori phonetic knowledge. In this paper, a two-stage splitting criterion is proposed to effectively construct the decision trees....

متن کامل

Robust tests for testing the parameters of a normal population

This article aims to provide a simple robust method to test the parameters of a normal population by using the new diagnostic tool called the “Forward Search” (FS) method. The most commonly used procedures to test the mean and variance of a normal distribution are Student’s t test and Chi-square test, respectively. These tests suffer from the presence of outliers. We introduce the FS version of...

متن کامل

Irrelevant variability normalization in learning HMM state tying from data based on phonetic decision-tree

We propose to apply the concept of irrelevant variability normalization to the general problem of learning structure f r o m data. Because of the problems of a diversified training data set and/or possible acoustic mismatches between training and testing conditions, the structure learned from the training data by using a maximum likelihood training method will not necessarily generalize well on...

متن کامل

Rights Creative Commons: Attribution 3.0 Hong Kong License IRRELEVANT VARIABILITY NORMALIZATION IN LEARNING HMM STATE TYING FROM DATA BASED ON PHONETIC DECISION-TREE

We propose to apply the concept of irrelevant variability normalization to the general problem of learning structure f r o m data. Because of the problems of a diversified training data set and/or possible acoustic mismatches between training and testing conditions, the structure learned from the training data by using a maximum likelihood training method will not necessarily generalize well on...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2002